An Improved System for Exon Recognition and Gene Modeling in Human DNA Sequence
نویسندگان
چکیده
A new version of the GRAIL system (Uberbacher and Mural, 1991; Mural et al., 1992; Uberbacher et al., 1993), called GRAIL II, has recently been developed (Xu et al., 1994). GRAIL II is a hybrid AI system that supports a number of DNA sequence analysis tools including protein-coding region recognition, PolyA site and transcription promoter recognition, gene model construction, translation to protein, and DNA/protein database searching capabilities. This paper presents the core of GRAIL II, the coding exon recognition and gene model construction algorithms. The exon recognition algorithm recognizes coding exons by combining coding feature analysis and edge signal (acceptor/donor/translation-start sites) detection. Unlike the original GRAIL system (Uberbacher and Mural, 1991; Mural et al., 1992), this algorithm uses variable-length windows tailored to each potential exon candidate, making its performance almost exon length-independent. In this algorithm, the recognition process is divided into four steps. Initially a large number of possible coding exon candidates are generated. Then a rule-based prescreening algorithm eliminates the majority of the improbable candidates. As the kernel of the recognition algorithm, three neural networks are trained to evaluate the remaining candidates. The outputs of the neural networks are then divided into clusters of candidates, corresponding to presumed exons. The algorithm makes its final prediction by picking the best canadidate from each cluster. The gene construction algorithm (Xu, Mural and Uberbacher, 1994) uses a dynamic programming approach to build gene models by using as input the clusters predicted by the exon recognition algorithm. Extensive testing has been done on these two algorithms.(ABSTRACT TRUNCATED AT 250 WORDS)
منابع مشابه
Identification of Novel Mutations in IL-2 Gene in Khorasan Native Fowls
The intron-exon structure of Khorasan native fowl interleukin-2 (IL-2) was investigated. For this purpose, twenty chickens were selected from the Native Fowl Breeding Station of Khorasan province, and genomic DNA was extracted using a modified conventional DNA extraction protocol. An 875 bp fragment of IL-2 was successfully amplified, including a small part of the promoter, exon 1, intron 1, an...
متن کاملO-36: Evaluation of Genetic Variations in Intron 4 and Exon 5 of RABL2B Gene in Infertile Men with Oligoasthenoteratospermia and Immotile Short Tail Sperm Defects
Background One of the main causes of male infertility is defect in structure and function of sperm cells. Infertile men with oligoasthenoteratospermia (OAT) defect, have sperms with abnormalities in count, motility and morphology. Patients with immotile short tail sperm (ISTS) disorder have immotile short-tailed sperm with disorganized axonem, and a significant decrease in sperm counts. Numerou...
متن کاملSequencing and Bioinformatics Analysis of Kappa-Casein Exon 4 Gene in Iranian Bacterianus and Dromedaries Camels
Kappa-casein, as a major protein component in mammalian milk, plays an essential role in formation and stabilization milk micelles and preventing them from aggregating and therefore, helping to keep calcium phosphate in solution and transfer of calcium and phosphors from animal milk to consumers. Therefore, the objective of the current study was to investigate genetic and phylogenetic analysis ...
متن کاملIsolation and Characterization of a New Peroxisome Deficient CHO Mutant Cell Belonging to Complementation Group 12
We searched for novel Chinese hamster ovary (CHO) cell mutants defective in peroxisome biogenesis by an improved method using peroxisome targeting sequence (PTS) of Pex3p (amino acid residues 1–40)-fused enhanced green fluorescent protein (EGFP). From mutagenized TKaEG3(1–40) cells, the wild-type CHO-K1 stably expressing rat Pex2p and of rat Pex3p(1–40)-EGFP, numerous cell colonies resistant to...
متن کاملC26232T Mutation in Nsun7 Gene and Reduce Sperm Motility in Asthenoteratospermic Men
Reduced sperm quantity and motility are primary causes of infertility in men. Before researchers showed that, Nsun7 gene has roles in sperm motility of mouse, that creation defect in this gene is cause infertility. This gene in human located in chromosome 4, with 12 exons and a hot spot exon (exon7). Our aim is study of the mutations of the exon7 in the normospermic and asthenoteratospermic men...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Proceedings. International Conference on Intelligent Systems for Molecular Biology
دوره 2 شماره
صفحات -
تاریخ انتشار 1994